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Abstract 

We consider noisy binary channels on regular trees and introduce peri- 
odic enhancements consisting of locally self-correcting the signal in blocks 
without break of the symmetry of the model. We focus on the realistic 
class of within-descent self-correction realized by identifying all descen- 
dants k generations down a vertex with their majority. We show that this 
also allows reconstruction strictly beyond the critical distortion. We fur- 
ther identify the limit at which the critical distortions of within-descent 
k self-corrected transmission converge, which turns out to be the criti- 
cal point for ferromagnetic Ising model on that tree. We finally discuss 
how similar phenomena take place with the biologically more plausible 
mechanism of eliminating signals which are locally not coherent with the 
majority. 
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1 Introduction 



We consider a binary channel on a regular tree, as in [T|, with a distortion 
rate s > at every transmission and are interested in the reconstruction of the 
starting bit do from the signals aw„ at the n-th generation of the tree. We 
focus on the majority rule, by which oo is reconstructed as the symbol having 
majority in aw n ■ 

In [1] it is shown that for regular trees the majority rule is asymptotically 
equivalent to the optimal maximum-likelihood rule, and that there is a critical 
distortion e c — :s ^=r such that for e > e c no asymptotic reconstruction takes 
place and for e < e c there is asymptotic reconstruction; see also [11] for a review 
and [12] for a dynamical version of these results. 

The aim of this paper is to investigate how a non-symmetry breaking mech- 
anism of correction performed while transmitting the signal can improve recon- 
struction by either majority or maximum likelihood. To this purpose, we pro- 
pose a local self-correction method by which the signal is periodically enhanced 
in blocks formed within the generations. The enhancement uses majority rule 
and consists of taking all signals in a block and changing them to all agree 
with their majority value (with random choice to break tie). The self-correction 
is based on the information available at the level of interest, and thus can in 
principle be performed while the signal is transmitted. From every vertex the 
transmission is then continued as it used to be in the original mechanism and 
the symmetry of the model is not broken. 

It is easy to see that with non-local enhancement one can reconstruct beyond 
the critical distortion: in fact, by forcing all vertices of each generation to agree 
with their majority, one can reconstruct for every e £ [0, 5). However, such 
correction involves taking majority on larger and larger blocks, which is not an 
implementable strategy. 

A slightly less expensive self-correction strategy consists of using blocks of 
fixed size M (as soon as the generation is large enough) and then performing 
self-correction at every generation. In section 2 we show that for any noise level 
£ < \ it is possible to achieve reconstruction in this way with sufficiently large 
block size M . This procedure has the advantage of involving only a bounded 
number of within generation information exchange in self-correcting a block, 
and thus could in principle be implemented by a real machine. However, it still 
involves a very large number of within generation operations, performed at each 
generation: if the cost of each such operation is not zero (as in basically all 
reasonable situations) then the total cost might become too high. 

We, therefore, restrict our attention, in the sequel, to a self-correction mecha- 
nism which contains costs by performing self-correction less often, and which has 
the additional advantage of being performed within the descent of some signal 
involved in the previous correction. This within descent self-correction reduces 
implementation costs, and allows signals to be dispersed and loose contact after 
their involvement in the enhancement, a feature which could be meaningful in 
a realistic setting. The within-descent self-correction at level k is performed by 
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taking each vertex at some Ik-th generation, I <G N, considering its r k descen- 
dants k generations down, and then changing them to agree with their majority 
(randomly breaking ties). 

At first sight, it is not even obvious that such reconstruction improves upon 
the non self-corrected transmission, but in section 3 we show that, except for k = 
1 and r = 2, the within-descent self-correction at level k strictly increases the 
critical distortions, and thus is an effective enhancement. The proof is based on 
the comparison between the self-correction based on the majority transformation 
with one correction based on random transformation which leaves the critical 
points unchanged. 

The rest of the work is devoted to identifying the limit of the critical distor- 
tions of the within-descent self-correction of level A; as fc diverges. Although it 
might seem that such mechanism is almost useless for large k, it turns out that 
instead it improves the transmission further. 

To identify the large k limit, in section 4 we exploit the correspondence with 
the Ising model. In fact, it is easy to see that, for regular trees, the reconstruc- 
tion problem is equivalent to the free boundary conditions phase transition of 
the ferromagnetic Ising model on the tree with inverse temperature /3 such that 
1 — 2e — tanh(/3). Such transition occurs at the critical inverse temperature (3 C 
such that for (3 > (3 C the free boundary Ising model is convex combination of 
the extremal states (see pQ for a detailed description). On the other hand, the 
Ising model undergoes its regular phase transition (with boundary conditions) 
at a lower inverse temperature (3 C < (3 C . In terms of p — tanh(/3) and on a reg- 
ular tree with forward branching rate r, we have p c = tanh(/3 c ) = - (as shown 
originally in [8]) and p c = tanh(/3 c ) = -j= (as shown in (21I31BJ)- 

Our self-correction at level k introduces thus new critical values 1 — 2e c (fc) = 
Pc(k) — tanh(/3 c (fc)) < p c and our main result is a bound on p c (k) showing 
that limfc^oo p c (k) = p c , the regular Ising model phase transition point. Such 
estimate is derived by introducing the FK representation of the Ising model and 
then comparing the information carried by the FK tree of the origin against 
the external "noise" produced by all other freely fluctuating clusters of vertices. 
We think that this comparison, which is based on Gaussian approximation and 
large deviation techniques, has an interest in itself as it gives a very natural way 
of evaluating the information available on the tree. 

In section 5 we remark that the majority self-correction is not biologically 
feasible, and introduce, instead, a minority removal self-correction which con- 
sists of self-correcting a generation by removing the elements not belonging to 
the majority. Since this leaves at least r fc /2 descendants, nothing really changes, 
and such correction also improves upon normal reconstruction up to the Ising 
model critical point. As we discuss, this, however, seems to indicate a peculiar 
phenomenon: it looks like that accepting the risk of creating uniform incorrect 
regions ("tumors") increases the resistance of inheritance to distortion. Whether 
this is a biologically meaningful statement should be further investigated with 
many bits models and realistic parameters. 

There remain several open issues. First of all, our bounds on p c {k) in section 
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4 are not sharp. Also, our analysis has been performed either for correction 
each k = 1 steps using large block size M or for correction every k steps with 
M = r k : we do not deal with the generic case of correcting blocks of size M each 
k generations. Solving the two issues above would then allow to treat the main 
open problem left by the present work: if one is to reconstruct the signal at a 
fixed generation n and if within generation transmission has some given cost, 
it would be natural to introduce a correspondence between within generation 
transmission costs and gain in reconstruction probability, and then look for the 
self-correction algorithm with optimal k and M. 

2 Large Block Reconstruction 

We consider regular trees T^with forward branching rate r > 0. The n-th 

(r) (r) 

level of the tree is indicated by T„ and T-Un represents the tree up to and 
including the n-th level. Vertices v of are then identified by coordinates 
v = (n, s) where n is the level and s = 1, r n numbers the vertices at the same 
level. Signals or configurations are variables {(tJ^j-h, u v e { — 1, 1}, and their 
distribution is specified by taking e > 0, P e (<7o = 1) = 1/2 and for each vertex 
v and predecessor ^v, P E (cr v = cr^ v ) = 1 — £ independently of all other pairs. 
Reconstruction under majority rule on (T^ r \P E ) takes place if 

< liminf A n (P E ) = : liminf (P E (S n > 0\a = 1) - P e (S n < 0\a = 1)) 

n n 

= liminf E E | P e (<j - l|S n ) - P e (<t = -l\S n )\ 

n 

(1) 

where S n = J2 veT (r} ° v - 

We first consicfer self-correction performed at each step using large blocks. 
We fix an integer M > and let n = max{fc : r k < M}. We then consider 
the n-th generation as block 0, and partition each of the following generations 
into blocks of size M as follows: vertices v — (n, s) <G T.^ are partitioned into 
\Jm\ blocks of vertices with consecutive coordinates s, and possibly one block 
of r" — L§j"_|M vertices, which is from now on discarded without affecting the 
argument which follows. Each block B is then connected to all blocks B' such 
that there are two vertices v € B and v' e B' which are connected on . One 
can easily see that considering blocks as renormalized vertices and connections 
between them as renormalized bonds we have a new tree with forward 

~(r) 

branching r at all vertices v € T„ , n > 1, and branching rate tq < r at the 
starting vertex vq. The branching rate of T^ r > is thus again r. 

Next, we consider self-corrected variables, which are required to be constant 
on blocks: 

Sm { — 1, 1} T< ' such that a v is constant on each block}, (2) 
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and the self-correction map $m : { — 1> 1} T r - » S M defined by 



e b 



and E^es' 7 " ^ 



(h-i) 



if t> e B c TW\T^ { ) S _ 1) 
and E«gb CT « = 



if w e r 



(r) 
>(«-!)' 



(3) 



where Z £ { — 1, 1} is a symmetric random variable. 

The transmission is then self-corrected by the map <I>Mat every step: cr m 

-'-(n-l) 

Em generates ct t m € { — 1, l} T<r> as usual, and then we take <I>m (c T (r) ^ € E M - 
The distribution P £j m of the self-corrected configuration is then recursively de- 



fined by P e ,M o 



->(«-!) 



= Pr ( *M ■ <T T (r) 



We then take configurations on the renormalized tree to be a© if i; rep- 
resents the block B and {^m<j) v — ct s for all v £ B, and indicate by ■ 
Em — * Em, with Em = {^v,v € T( r )} = {—1, 1}^' the renormalizing trans- 
formation. Renormalized configurations are described by P £j m = ^ m ° Pt.m on 
(the Borel cr-algebra of) Em- 

Our first result is that, no matter how large the noise level e £ [0, 5) is, with 
large enough block size M it is possible to reconstruct the starting signal er 
after performing the M-block self-correction at each step. 



Theorem 2.1 Ve £ [0, |) 3M : VM > M 



liminf A„(P £jM ) > 0. 



Proof. We first calculate the error rate sm on the renormalized tree T^: let 
B be any block of size M of direct descendant of some site v' £ B' , where B is 
a descendant of B' in p( r ); then 

SM = Pe(J2 a - < °K' = !) + = = X )- W 

Given a v >, the cr„'s are {— 1, l}-i.i.d. random variables with P £ (cr v = l|ov = 
1) = 1 — £ > 2> so that by large deviations theory there exists c e > such that 
£m < er c '- M for all M > 0. Therefore, for M large enough, 
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(1 - 2s M yr > (1 - 2e~^ M Yr > 1. (5) 

This implies that e~m < £c and there is reconstruction on the renormalized tree 
f( r \ By [1] this implies that for such M's: 



liminf P e .M(^o = 1| J! ^ > 0) - P e ,jjf (a = -1| 51 ^ > 0) J > 0. 

(6) 

Now, cr = 1 if X)„ eT M CT « > or ' witn probability |, if X^ 6T < r) °^ = °- 
Therefore, 

/ \ 

liminf P e ( ^ > 0| £j ct„ > 0) - P e ( ^ ct„ < 0| 51 *e > 0) > 0. 

(7) 

- (r) 

We now show that by reading the block variables tr B for v € T„ one can 
reconstruct ctq. To this purpose let 



and 



A = {a = +l}, 
B = { J2 °v > 0} 

»erj p > 



C = { 5] <r e > 0}. 



(8) 

We then have, by total probabilities theorem, the Markov property and the 
fact that P(A\B C ) = P(A C \B) (with the same equality when A and A c are 
exchanged) , 



P e (A\C) - P £ (A C \C) 
= P £ {A\C n B)P £ {B\C) + P £ {A\C n B C )P £ {B C \C) 
-{P £ {A C \C (1 B)P £ (B\C) + P £ (A C \C n B C )P £ (B C \C)) (9) 
= P £ (A\B)P £ (B\C) + P £ (A\B c )Pe(B c \C) 
-(P £ (A C \B)P £ (B\C) + P £ (A c \B c )Pe(B c \C)) 
-- (P e (A\B) - P £ (A C \B))(P £ (B\C) - P £ (B C \C)) > 0; 
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the last inequality holds since it follows from J7|) that if M is large enough, 
liminf„(P s (S|C) — P e (B c \C)) > 0, and it follows from the next Lemma that 
P S (A\B) - P S (A C \B) > for every n. U 

Lemma 2.2 Consider any tree T (r) and a transmission problem described by 
the distribution P e , let S n (a) = S„ = X^ e x (r) <7v ' Then 

i) 

P e (S„-l > 0\S n > 0) - Pe(S n -i < 0\S„ > 0) > 

ii) 

P £ {S n >Q\D)-P e (S n <0\D) >0 

for every D C { — 1, l} 7 ™- 1 such that Vcr G D, S n -i(a) > 0. 
iii) 

P e {S n > 0\a T ir)J - Pe{S n < 0\a T M) > 

for every configuration a^r) £ {—1, l} 1 ™- 1 such that Y] ^(r) a v = I > 0. 
iv) 

P,(S n - k > 0\S n > 0) - P e (S n -k < 0\S„ > 0) > 

for every k = 1, n. 

Proof. Clearly ii) implies i) taking D = {S n -i > 0}, and iii) implies ii) since 



P e (S n >0\S n -i> 0) 

J2 MSn > 0|a T („ i )P e ('7 T M i \S n -l > 0) 



(10) 



To show iii) assume c t<*o iJt, = Z > 0. Then 5„ = Ei=i 2 -Xi+S ^-i-i 

fe l = 2 

YCrn-i-i+i X i with X, i.i.d, i.i.d, X l ,Y l G {-r,r} and X t = £j =1 Xj, *j 
i.i.d, X, G {-1, 1}, = 1) = l-e and Y t = ££ =1 Y,-, Yj i.i.d, Y, G {-1, 1}, 

P(Y,- = 1) = e, all these variables being independent. So Xi is distributed like 
5i conditioned to Co = 1 and, by symmetry of the distribution of Si, Xi = d — Yj, 
so that 

S n = X,+ K < ( n ) 
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is a symmetric random variable. Therefore, 



P £ (S n >0\a T (r )i ) = P e (S n + ^ X, > 0) 

i— r n-l_l + l 

= P e(Sn+ J2 Xi > 0\S n = h)P e (S n = h) 

, r - 1_, T r n-i 
= E P ^ E ^ > -il|Sn = il) 



ii>0 



+P £ ( ^ X j >/i|5 n = -ii) 



j= r n.-l_/+l 



Pe{Sn — h) 



+ P £ ( J2 X i >0\S n = 0)P e (S n = 0) 



E 



i>0 



p e (J2^ > -h) + p e (j2^ > h) 



i=i 



i=i 



Pe(Sn = h) 
+ P £ (Y / X i>°) P e(Sn = 0) 

(12) 



i=l 



By the analogous expression for S n < we then need 



p e (52x i >-h)+p e (22x i >i 1 ) 

i=l i=l 

i / 

>P £ (^X i < -l^+P^Xi <h) (13) 



1 i=l 



For every I > 1 and > 0, we have J2i=i Xi — Sj=i -^j an d 

rl rl 



Also, by the change of variable rl — h' = h, 



P B (£,X j >l 1 ) = E ( * 



(14) 



j' = l h'=0 
rl 



w + ii 



e ( )d- e ) rt -v 



(15) 



So that, for e < \, 



P e (E -Xi > ii) ~ Pe(J2 X * < ~h) 



i=l 



, _H+Ii 



E ( ? ) (1 - e) rl ~ h e rl - h ((l - ef h ~ H - e 2h ~ rl ) > 0. 



(16) 

This shows (fT3|l since we have seen one strict inequality between two terms, and 
the other two terms satisfy 



p E (j2x l >-i' 1 )~p £ (j2x i <i[) 

i=l »=1 

I I 

= p £ (E x - ^ - p tCE x - ^ > 



i=l i=l 

(17) 

for the same inequality (fl6|l applied to Zi = — 1 > 0. 

Finally, (iv) is shown using iteratively ([6]) for k larger than one with 

A = {5„_ fc >0} 
B = {S n ~ k+1 > 0} 

and 



C = {S„>0}. 
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3 Within- descent self-correction: strict inequal- 
ity of critical points 

Our aim is to consider within-descent self-correction at some level k. To this 
purpose we take a vertex v in some generation mk, m <G N, and look at its r k 

(r) 

descendants k generation down (thus in P( m+1 ) fc ) as generated by the transmis- 
sion; we then force all such descendants to agree to their majority (with random 
choice if there is no majority). Transmission is then resumed as usual from the 
modified status. This amounts to define a map : { — 1, 1} T< ' — > Sfe given by 



$fc(cr)„ = < 



1 with probability 1 if YZ'=i a mk, si r k +s' > 



-1 with probability 1 if =1 <J m k, Sl r"+s' < 



1 with probability 1/2 
-1 with probability 1/2 



h° 5^s'=l ^mk,s\r k +sL 



(18) 

if v e T%1, with « = (mfc,sir fe + s 2 ), si - 0, r^™" 1 ) - 1, s 2 = l,...,r fc ; 
otherwise 



= <7„. 



(19) 



As before, the transmission is self-corrected by the map every k steps: 
<T ( r ) € Sfe generates cr T (r) € {— 1, 1} T< ' as usual, and then we take 



(k) 

) € Sfe. The distribution P e v of the self-corrected configuration 



'-.(m+l)* 

is then recursively defined by 



-to 

-.(m + l)fc 



|<7 T (r) )=P e ($ fc 1 (7 T (r 



(m+l)*A T - 



r) |CT T (r) )• 



(20) 



(k) 

Notice that P e is no longer a Markov chain but the conditional probabilities 
satisfy 



Ps k ^( a T {r) \ a T {r > ) = Pe( (7 T ir > \ a T (r > ) 

" ->(n-l) n -.(n-1) 



= P £ (f7 T (r) |C7 (r) ) 



'(n-l) 



(21) 

for all n not of the form n = mfc. 
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Next, for a e £ fe , let * fe (cr) € T^") be defined by 



^k(cr)v = o"( mfeiSir * + i) (22) 

if w € ri rfc) , t; = (m,sir fc + s 2 ), si = 0, r^™- 1 ) - 1, s 2 = l,...,r fc . Note that 
ty k (cr) is a configuration of an almost regular tree T^ r ); T^ r ) has branching 
rate 1 at the starting vertex and then r fe at all other vertices. As we will see, the 
initial segment makes no difference in our arguments, and, therefore, we adopt 
the slight abuse of notation T^ r ) (which in our definitions indicates a regular 
tree). 

(k) 

Using Pg we define the self-corrected critical distortions 

e cr (k) = sup{e : liminf A„(P £ (fe) ) > 0}. (23) 

n 

Note that on * fc ($ fc ({-l, l} T<r> )) = T^ k) the distribution ^ k {P^ k) ) = 

(k) —1 (k) 

Pe ■ ^f. is a Markov chain, by the definition of P e , and thus it is again 

a transmission model with error rate e(k). In other words, ^(pj*^) = P e (k) on 
T (r k )_ 

We first show that reconstruction under ^ k (P £ k ^) on T^ r ' is equivalent to 

(k) 

reconstruction under the fc-self corrected distribution P £ . 

Lemma 3.1 liminf „ A„(P £ (fe) ) > if and only if liminf n A n (tf fc (P e (fc) )) > 

Proof. First, observe that liminf „ A n ($> k (P^)) > on * fe (S fc ) if and only if 
liminf „ A n (tf fc (P e (fc) )) > on T {r "\ In fact, on tf fc (E fe ) we obtain 

P e{k) {S n > 01(70 >0) 

= liminf [(1 - e(k))P e{k) (S n > 0|<7 (1;1) > 0) (24) 

+e(k)P £(k) (S n >0|a (1;1) <0)] 
= (1 - 2e(fc)) liminf P c(fc) (S„ > 0|a (M) > 0) + s(k) 

(25) 

so that 



P s{k) (S„ > 0\a > 0) - P £[k) {S n < 0|a > 0) 
= (1 - 2 £ (A0)A„OMP( fe ))); 



(26) 
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the lim inf „ of the last expression is positive if and only if lim inf „ A n (^ k (Pe ) ) > 
on as e(k) < 1/2. Now, observe that lim inf „ A n (P E (fc) ) > implies 

liminf OTfc A mk {P £ (k) ) > 0, that is liminf„ A„(* fc (P £ (fe) )) > on * k ({-l, l} T(r) ). 

To show the reverse implication, notice that for every level n of T^ r > not of 
the form n = mk we have 

A„(P«) = A n _ 1 (P E W)(pW(,S ri > Q\S n -x > 0) - P e W(5 n < 0|5„_! > 0)) 

(27) 

where if n — 1 = mk then <S n _i = J2 ve T (r) (^k( cr ))f In all cases, the event 
D = {S n -i > 0} is such that a G D satisfies Yl v( zT {r) ° v > ®> tms i m P nes 

P e (fc) (5„ > 0\S n -i > 0) > P e (fc) (S*„ < 0|5„_i > 0) by part ii) of Lemma 2.2 
applied to P £ , since, by J2TJ), the conditional probabilities coincide with those 

0fP E . 

Therefore, computing A n (Pe ) by finite iteration from the maximum level 
mk < n, liminf TO fe A m fe(Pe ) > implies lim inf „ A n (Pi fc ^) > 0. ■ 

Our next aim is to show that e c . r (k) > e c<r , which is to say p c ,r(k) < p c ,r, 
where e c ,r is the critical distortion rate for majority or maximum likelihood 
reconstruction on 

In order to do this we introduce another random transformation, the fraction 
identification transform ■ {— 1> l} T<r> — > { — 1, l} T<r) given by 

§ k (<r) v =<Tv (28) 

if v G T^l, with v = (mk, s\r k + s 2 ), si = 0, r k( - m ~ 1 '> - 1, s 2 = 1, r k , and 
v = (mk, s\r k + S2), S2 = 1, ■■■,r k uniformly chosen at random. Otherwise 

$k(o-)„ = o-v (29) 

As before, for ct G l} T<r) ), let * fc (cr) G T (rfe) be defined by 

*fe(o")« = CT(m,sir*)- (30) 

Now, the strict inequality between the self-corrected critical distortion and 
the original one can be proven. The strict inequality holds for all values of k 
and r except for the one step correction on binary trees. 

Theorem 3.2 // k > 1 or k = 1, r > 2 

£ c ,r(k) > e c y, (31) 
e ,a(l) = e c , 2 (32) 
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To prove this fact, we explicitly compute the noise change under the frac- 
tion identification. On 1, 1} T ('')) the probability distribution P e (k) which 
implements the fraction transform is defined as P^ with e(k) replaced by 

i{k) = 1 -^4=1(2^^+4 - !)• Note that * fe (P e (fe) ) = P i(k) on T^). 
We then have 

Lemma 3.3 Ve, Vfc 

1 - 2i(k) = (1 - 2e) k (33) 
therefore the critical distortion £ c ,r{k) — sup{e : liminf n A„ (P e (fc) ) > 0} equals 

Proof. Denote by X k the number of l's at level k. By definition and linearity 
of expected values, 

£k(k) = 1 - ^E £ (X k \a = 1) = 1 - P e (<Tv = Iko = 1) (34) 

(r) 

for every v € T£ ', The last probability refers to a one-dimensional Markov 
chain of length k with distortion probability e, and can be easily computed. 
Alternatively, f33|) can be verified by induction, since by the last equality, 1 — 
2e(l) = 1 - 2e and 

e[k) = e{\ - e(k - 1)) + (1 - e)e(k - 1), (35) 

so that 

1 - 2e{k) = (1 - 2e)(l - 2e(k - 1)) = (1 - 2e) k . (36) 

From p], (1 — 2e Cjr ) 2 r = 1 and since ^k{P e {k)) is on , on this second tree 
criticality is identified by (l-2e c r k) 2 r k = 1 and {33]) implies (1 - 2e c ,r(k)) 2 r k = 
((1 - 2e c , r ) k ) 2 r k = ((1 - 2s c , r ) 2 r) k = 1 . So e c , r (fc) = e c , r ,. ■ 

Proof of Theorem 3.2 Introduce 

r k -l 

T *A £ ) = -rJ2 l ( p e(Xk = l\*o = 0) - P e (X k = l\a = 1)) (37) 

1=0 

when r is odd, and 

k 

■^2 1 

T k ,r(z) = ^Y, 1 ^ Xk = l \ a ° = °) - P ^ Xk = ^ = X )) ( 38 ) 

2=0 

when r is even. For r odd, we have 
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T k , r (e) = ^ J2 (r k -l)P s (X k = r k -l\a Q = 0)-^ ^ lP e (X k = l\a = 1) 

■ _ r*+l i = 

I— 2 

= e(fr) - e(fc) 

(39) 

and, for r even 



r fc)r (e) = ^ ]T (r* - Z)P £ (*fc = r fe - /K = 0) 



r 



1 £ *P £ (X fe - Z|^ = 1) + ^P £ (X fe = y) - 
(=0 

e(A) - e(k) 



(40) 

By Lemma 3.3 e c , r (k) — e c _ r k and T\. 2(^,2) = 0, so it is sufficient to show that 
Tk,r(S c .r) > for the non trivial cases of k and r. Theorem 1.4 in |Tj shows that 
P e {X k = Z|cr = 0) > P £ (X k = l\a = 1) if r k -I > I. To have strict inequality it 
is sufficient to show that P E (X k = l\a Q = 0) > P e {X k = l]cr = !)• This will be 
done by induction in k. We focus on the number i of distortions of ao at the first 
step. The index i runs from to r, but it is convenient to group together the 

z-th and the (r-i)-th terms. Note that P e (X 1 = i\cr = 0) = ( ^ e i (l-e) r -\ 

Assuming i — for r odd and i = | + 1 if r is even and i > i, the terms in 
T kiT can be collected like this 



TuAe) = E ( f J T ^^( £ ) ( 41 ) 

i— j 

with 

r fc , r>i (e) = [e*(l-£) r - i -(l- E )V- i ] 

• [iP £ (X fe _! = 1|ct = l)(P e (Xjk_ x = 0|a = I))*" 1 
•(P £ (X fe _i - OK = 0)) M + (r - »)(P e (X fc _i - 0|a = 1))' 
•P £ (X fe _! - 1|<t = 0)(P £ (AVi = OK = 0)) M - 1 
-tP e (Xjk_! = l|o- - Q)(P e (X*-i = 0|a = O)) 4 " 1 
■(P e (X fc _! = 0|a = 1)) M - (r - z)(P £ (AVi = 0|a = 0j) 1 
■P £ (X k ^ = l\a = l)(P e (X fe _! = 0|a = l))^- 1 ] 
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(42) 



Now, the first factor is negative if e G (0, 1/2) in particular if e = e c . r . We now 
show that the second factor is negative as well under the hypothesis that the 
statement is true for k — 1. 

The (r—i) terms of the second addend are greater than or equal to (r — i) < i 
terms taken from the third addend since 

P £ (X fc _! - 0\a = 0) > P £ {X k ^ = Ol^o = 1) (43) 

again by pQ. The remaining (2i — r) terms from the third addend are strictly 
less than (2i — r) < i terms taken from the first since 



P e {X k _ x = 1|<7 = 0)P e (X k _ 1 = OK = 0) 

> P e (X fe _a = l|a = l)P e {Xk-x = 0|a = 1); 

(44) 

in fact, PsiXk-! = 0\a Q = 0) > P e (X k -i = 0\a = 1) follows from pLJ, and 
P e (Xk~i = l\cro = 0) > P £ {Xk-i = l\o~o = 1) follows by the induction hypoth- 
esis. 

Finally, the remaining (r — i) terms in the first addend are greater than or 
equal to the (r — i) terms in the fourth addend again by [I]. 

For r = 2 and k = 2 the statement is true, as, by direct computation, we 
have, for some /(e), 

P e (X 2 = l\a = 0) - P £ (X 2 = l\a = 1) 

= 4(1 - e) 5 e + 2e(l - e)f(e) + 4(1 - e)e 5 - 8(1 - e) 3 - 2e(l - e)f(e) 
= 4 £ (l-e)((l-e) 2 + e 2 ) 2 

(45) 

which is positive for s S (0,1/2). For r > 2 and k = 1 the statement is 
true as well in the same domain as P e {X\ — l|ero = 0) — P s (Xi = l|ao = 1) = 
e(l-e) r - 1 > (l-e)e r - 1 . ■ 

4 Limit of within-descent self-corrected critical 
distortions 

The transmission model we are considering can equivalently be rewritten (see 
jl]) as an Ising model [ip with inverse temperature (3 such that 

tanh(/3) = 1 - 2e (46) 



1.5 



and 

m ,(a Ti >) = i e ^<-->^ (47) 

where [ip is any weak limit of fip,^. In turn, this can be represented as an FK 
model, see |5j. The usual FK parameter p' = 1 — e 2/3 can then be modified on the 
tree, to account also for the number of clusters, to p — ^7 = tanh(/3) = 1 — 2e, 
With H = {0, 1} £ ( T<r) ), where £ (T^) are the length 1 edges of T< r ) and 77 e ff, 
denoting by £(T±ln) the edges of TATn, we have 

",(v«))= n p^-'a-p) 1 -^-'. (48) 

Therefore, the FK model is simply an independent Galton- Watson branching 
process with each descendant generated independently with probability p. The 
relation between v p and ftp is the usual (see [5]) 

M^0= E "(is&i)) Cl{ ) — ) ( 4Q ) 

where ~ means that cr is compatible with 77, i.e., <j^ v <j v V(^v,v) > 0, and 
Cl(r] £ ( T ( r ) equals the number of er's compatible with the given 77, i.e. the 
number of site clusters determined by 1-edges in 77. 

In this section we want to show that £ c ,r(k) — > e c ,r, i.e. p c ,r(k) — > p c ,r and 
the main results will be 

Theorem 4.1 There exist ci,C2 > and a function > 0, lim^oo au = 
such that 

l v _L< Pc{k) <l+^ (50 ) 

r £ 2k r c 2k r 



so that it easily follows 
Corollary 4.2 



lim p c (k) = -. 



The FK representation is thus a process in which each edge e € £(T^) is 
open, i.e. 77,= = 1, independently of all other edges, with probability p. The open 
edges are then just the (randomly selected) error fre edges in the transmission, 
in the sense that, given the configuration of the edges, the signal is generated 
by: 

i) fixing the signal Co at the origin; 
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ii) having the signal transmitted error free through the open edges; 

iii) having the signal chosen at random with equal probability through the 

closed edges. 

Seen globally, the set of vertices of falls apart into maximal connected 
components connected by open edges, and such components are called clusters. 
The cluster containing a vertex v is indicated by C{y). Notice that C(0) de- 
scribes the descendants of a Galton- Watson process with offspring distribution 
Bernoulli of parameters r and p. The configuration of FK edges can also be 

described by some r\ S {0, 1} £ ( T ). 

(r) 

As before, let T„ be the vertices in the n-th generation of the tree. The 

(r) 

vertices of T« also fall apart into "clusters" connected, via the entire tree, by 
open edges (these "clusters" are just the intersection of the clusters of with 
Tn^). Given a configuration r\ € {0,1} £( - T< ^ of open, i.e. value 1, FK edges, 
let Zi — Zi(rj), i = 1, ...,m n — m n (rf), be the clusters oiTn ^ in i], 1 < m n < r n , 
and let = \Z%\. 

Notice that \Pfc(3>fc(cr)) is a configuration of T^ r ) and that on such tree there 
is reconstruction if the FK density p(k) — p r k is such that p 2 k r k > 1 (see [1]). 

On the other hand, by our construction, p r k = 1 — 2P(Sk > 0|<7o = 1), so 
we need a lower bound for this expression. Such lower bound is achieved by 
estimating the size of (7(0) n Tjf ] , which is the set carrying information, and the 
value of Y^i=i where Z[ are independent symmetric random variables taking 
values in {—Zi,Zi}, i.e. distributed as the Z^s. This last sum can be estimated 
via the normal approximation using Berry-Essen estimates of the error. This, 
however, involves second and third moments of Zi, and we need to develop a 
somewhat elaborate bound on these moments since simple ones based on the 
maximum size of Zi are not sufficient. 

Such bounds on the sums of moments of Zi 's are determined in Theorems 
4.2 and 4.3 below, as follows. First, notice that in creating the fc-th generation 
roughly (1 — p)r fc_1 vertices are isolated, thus giving rise to the same number of 
Zi's taking values in {—1, 1}. Therefore, X)"=i z f — cr " k f° r some c > and our 
first two estimates show that this bound is nearly optimal. On the other hand, 
the largest cluster is of size roughly (pr) k , so that zf ~ (pr) 3k = (p 2 r) k (pr 2 ) k < 
(1 — c) k (pr 2 ) k if p 2 r < 1. Our last estimate shows that also this bound is nearly 
optimal. Note that this estimate cannot hold if p 2 r > 1, so that it provides no 
information about the reconstruction regime of the original tree. 

We first need a large deviation result for the size of the set of vertices R n = 
C(0) n Tn\ i.e. for the survival set of the Galton- Watson process in the n- th 
generation. Let P p = P e for p = 1 — 2e. 

Lemma 4.3 Let 7 = logr /log{pr) > 1 and 7* such that I/7 + I/7* = 1 and let 
W = linin^oo TppyJr (see [6]). Indicating by P the distribution of W and by E 
the expected value with respect to P, if pr > 1 then there exist M,ci,C2,cs > 
such that if e > is such that (1 + e) 7 < (pr) 1 / 3 and I € N is such that 



17 



y^yffi-D < ci(pr) 1 / 3 and (1 + e) l /2 > M V 1 with r = max I<F H{x) < oo 
and = a;" 7 log(B r • $(s) = £(e sW ) and S r the Bottcher's 

(see JTffi). then 



P p {\Rl\ > (l + e) l p l r l )<c 2 e c ^ 1+ ^ 



(51) 



for all I E N. 



Proof. By large deviation properties of W, there exists M > such that for 
all x > M 



P(W >x)< exp 



v7 *( 7r )V(7-i) 
for all x. Also, there exist C4, C5 > such that 

\Rr 



P 



(pr) r 



W 



> 1 < C4e C6(pr,) 



>/3 



(52) 



(53) 



for all n, see [7], Theorem 5; the conditions of that result are easily met by 
considering a process with the offspring of R n plus one additional offspring in 
each vertex. Therefore, under the current assumptions, for some c 2 > c 4 + 1 
and all I £ N 



P{\Rn\ > (1 + OVr*) < P 



\Rn 



< c 4 e 



(pr) n 

cr,(pr) l/3 



- W 



+ exp 



> lj +P(W> (l+e) l /2) 
7 *( 7 r)V(7-i) 



< c 2 e 



c 3 (l+s)~<*' 



(54) 

• r _ 1 k 

11 C 3 — 7 *(27*7t) 1 /(t-i) • ■ 

Theorem 4.4 Vp and r wi^ft P p -probability one there exists a constant cj — 
07(77) > such that 

E 2 ? ^ c ^ fe (55) 

i=l 

for all k larger than some k-r(rj). 

Proof. > E C: cnTr^,|c|=il C l 2 = K C ^ ^ : 1^1 = !>l = : 

For every 6 = (*~v,v), *~v £ T/-^, ry& is independently chosen to be with 
probability 1 — p, and in such a case C(v) — {v}. So, by large deviations 
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estimates for r k i.i.d. binary random variables, if C7 = —f-, P(ifc < cyr fc ) < 
e _C3 ~3 £ ~ r * for some C3 > (see, for instance [9]) 

Therefore, X)feli Pp(W '■ ^ — C7r fc ) < J2T=i e~ C!>L ^ Elrk < 00 and by Borel- 



Cantelli the statement holds with P p -probability 1 for large k with c-j = - 



2 • 



Theorem 4.5 Suppose p 2 r < 1 and pr > 1. For every a > i/iere ex«'si 
cs = cg(a) > and, with P p -probability one, a finite ks(v) > such that 

E zHv)<c 8 (l + a) k r k (56) 

i=l 

for all k > k$(r]). 

Proof. Let 7 — > 1 and 7* such that ^ + ^ = 1 and take e\ such 

that (1 + £i) 7 * < (pr) 1 / 3 and (1 + e 1 ) 4 p 2 r < 1. By Lemma 4.1, if n € N and 
V = V(n) C T( r ) is some set of vertices, then, since (1 +£i) 7 * < (pr) 1 / 3 we 
have 



P p (A v (n)) = P p (BveV(n):\C(v)nT^\>(l+E 1 ) n -\ v \(pr) n -\ v \) 
< c 5 e- c ^ 1+£l )"'* < " H " l, 

(57) 

Recursively define Vj and as follows: 

V VI \ S cT(r) M^W log((l + £l)V0 _1 l 

Ki = Ki(n) = < w S J y ' : |f I < din = n — v 



logr 



i-i 
j'=i 



we then have 



log ((1 + e 1 ) A( - 1 ~ d i- 1 ' ) p 2 ( 1 - d i- 1 ' ) r 1 - 2d i- 1 ) \ 

\v\ < djn = n > 

logr J 

(58) 



r din _ 1 



((l + £l )Vrr ! 

1 



((1 + £l )4(l-d J _i) p 2(l-d i _ 1 ) r l-2 ! i J _ i y 
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(59) 



PMvAn)) < ({l + e 1 )Vr)- n c,e-^r^' n{1 - dl \ 
P P {A Vj {n)) < ({l + e 1 )^ 1 - d ^p^ 1 - d ^r^- 2d ^y n 

. C5e -(^) 7 *(i+^r*" (1 ^ j) 



(60) 



On A V] (n) c we have 



2n(l~d j _ 1 ) 



vevj 

< (1 + e 1 y 2n(1 - d J- l) r n . 



(61) 



Note that for j = 2,3, ... 



dj = (1 - rfj-i) l0g(1 ^ )4p2r + dj-i - (1 - d^-Odi + dj_i (62) 

and that d\ £ (0, 1) since (1 + e\) A p 2 r < 1, so that lim^oo dj = 1. 

On the other hand, for the given a > let pi be such that r Pl < 1 + a; then, 

k 



if for any cluster C we let Base{C) = min{fc : C n T^ r) ^ 0}, we have 



C^e^l-*)" C C:Base(C)> { l- Pl )n 

< |TM|r Pl " 

< (l + a) n r n . 

(63) 

Next, take Ji € N such that dj x > (1 — pi). Then 



OO Jl J 1 CO 

n— 1 j= 1 j = 1 n— 1 



■c 5 e-^ 1+ ^' n ' 1 - di) < +oo 

(64) 

since for each j the series is of the form A n e~ B , with A > 1 and i? > 0, thus 
convergent. This implies that, by Borel-Cantelli, Av 1 (n)UAv 2 (n)U...UAv Jli (n) 
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occurs only for a finite number of n's with probability one. Thus, for almost all 
r\ there exists k%{r\) such that for all k > k$(ri), f)jl-i Av } (k) c occurs and this 
implies 



m k (rj) 

= 1 C 



E A - Ei cnT i 



Ji 



< E \cnr^+Y: E . 

C:Base(C)>(l-pi)fc J=l C:Bose(C)eV r 3 - 

< (l + a) fe r fe + (l+ei)- 2fe(1 - d 'iV fc Ji 

< c 8 (l + a) fe r fc 

(65) 

for a suitable cs = c&(Ji). I 

Theorem 4.6 7/ p 2 r < 1 and pr > 1, i/ien t/iere exisi a' > 0, eg > and, with 
Pp-probability one, a finite kg(r]) > such that for every a' < a' 

m k (r)) 

]T zf<c 9 (l-a') k (pr 2 ) k (66) 

/or a// fc > ^39(77). 

Proof. We proceed as in the proof of Theorem 4.2 by taking s\, Vj, Ay^n). 
On Ay. (n) c we now have 

E|C(w)nT«| 3 < ^"((l + eijpr) 3 ^ 1 -^- 1 ) 
veVj 

< (1 + e 1 ) _ "( 1_d J- 1 )p™( 1 ~' i 3- 1 )r( 2_dj '- 1 ) n 



< (vr 2 ) n 

~ {{l + ei) 1 - d i-Hpr) d ^ 1 ) nK ' 



(67) 



with dj's defined as above. 

Now, take p 2 > such that p 2 < ^fsf?- Then 



V |cnr«| 3 < Y\cr\T^\ max |cnr«| 

C:Base(C)>(l- P2 )n C K ' ! 

< r n r 2p 2 n 

< r n {pr) n {pr)- n l 2 

< (l-a') n (pr 2 ) n 
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(68) 

provided that 1 — ol > -4= . 

Next, take J 2 € N such that dj 2 > 1 — p 2 and note that the Borel-Cantelli 
Lemma applies as above. Take a' also satisfying 1 — a' > (1 + ei)~^~ dj ^ . 
Then, for k > £9(77), 



mk(rj) 
i=l c 



J2 

1 ! h"> V \ * \n(„\ n tM|3 



(69) 



C:Base(C)>(l-p 2 )k = 1 C:Base(C)£V j 

^ ( 1 -^ fc ^ 2 ) fc+ ( 1+£l ) 1 ( 1 ^) fe ^ 2 ) fc 

< c 9 (l-a') k (pr 2 ) k . 

U 

The next result gives the inequality for critical points p c (k). 

For the lower bound we need 

Lemma 4.8 If Zi's, i — 1, m, are independent random variables each taking 
value in some {—1,1}, I £ N such that Zi e { — 1,1} for alii = 1, then for 
every a > and m > I > we have 

m I 

P(J2z i e[-a,a})<P(^2z i e[-a,a}) (70) 

i=l i=l 



Proof. Since pfe = P£[ =1 Z» = fc) = ^ ^ + ^,^ 2 ) 2 Pfe mcreases U P to 
7/2 and decreases afterwards; then, letting = 5Z i=1 Zj, we have 
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P(S m e [-a, a]) = P(5j e [-a, a], 5 m G [-a, a]) 
+P(Sj ^ [-a.a],^™ e [-a, a]) 
= P(Sje[-a,«],S m e[-a,a]) 

+ Yl E P{S I =t,S m - I = l) 

tf[-a,a] le[-a-t,a-t] 

< P(S, e [-a, a], S m e [-a, a]) (71) 

+ X! E ^/=i+z,5 m _/ = -o 

tg[— a,a] Je[— a— t,a— t] 

= P(Si € [— a, a],S m G [—a, a]) 

+P{Si e [-a, a], 5 m ^ [-a, a]) 
= P(5/e[-a,a]) 



For the upper bound we need an estimate for the error rate s(k) at distance 
k, i.e. the value defined by 



1 - e(k) = P p 



<?v > \a = 1 + JpJ ^ cr,, = |a = 1 



\ 



(r) 



(72) 



Lemma 4.7 // p 2 r < 1 and pr > 1 iften iftere exists Cio > SMc/i iftai /or every 
a > iwift probability one there exists ku finite such that for all k > k\o 



(73) 



Proof. We have 



Pp\ a v >0 



1 + \ P A E ^ = 



cr = 1 



fGT, 



(r) 
Is 



(r) 
A- 
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2 p 





pp 


51 cr t , > 

\ UKz± h 


( 






E ct " 


= 


a = 1, \R k \ < 








+ 


P P 


< J v>0 


E ct « 


= 


ffo = 1,|JZ*| > 









(TO = l,|i?fc| < 



(pr) A 



(74) 



(pr) A 



P P \R k \ < 



(pr) k 



(pr) h 



P P \R k \ > 



(pr) k 



cr = 1 



Notice that for each j] e {-1, 1} s ( tW ), E^,) f« = ^< + |i2*|, with 

Zi symmetric random variables. Therefore, 

p p ( Y °v > oko = i, |i2fc| < ^) + \p p { Y ^ = °ko = i, \Rk\ < 



v<£T> 



* E 

rj e {-1, 1} £ ( tM ) 
|P fe | < ^ 



'm k (jj) 



m k (ri) 



pj jr Zi > o i*? + ip„ 53 ^ = o h 



(75) 



For the second part of ([74|) we use that 



P p ( Y *v > Oko = 1, \Rk\ > + \p p ( E a " = °l^o = 1, \Rk\ > ^) 



vet, 



(r) 



m k (ri) 



> E 

r,:|fl. fe (^)|><^ 

+ Pp £ ^ 



1 1 X 

>- + - E p P 



p v Y z i >o\ v \+-p P [ Y z i = a\v 



1=1 



2 2 



T,:|ii fc (r,)|>-^ 



"ifc(»)) 

E z > 



p,(iijfci > ^) 



2 '" ' P P (\R k \ > ^) 



(76) 
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Then 



p P ( E a « > = 1 ) + \ p p( E a " = = x ) 



> 



1 i 

2 + 2 



vETt 



E 



P, 



V-\Rk(ri)\>' 



m k (rj) 

E 3 

i=l 



< ^ 1*7 I P P W (77) 



Since the random variable defined in Lemma 4.1 is absolutely continuous 
and E{W) = 1 (see (BJ), then P(W > |) > 0. Moreover, Jj|^ converges in 
distribution to W, so there exists a non random ki such that for all k >k\ 



(78) 



Av) 



< 



(prf 



r\ ) via the Gaussian ap- 



We then want to estimate P p ^ Yl^i 
proximation using the Berry-Essen estimates of the error. To this extent, we 
will use the results in Theorems 4.2, 4.3 and 4.4 with a of Theorem 4.3 such that 
(l + a)~ 1//2 > 1 — a', with a' < a' and a' determined as in Theorem 4.4. Such re- 
sults hold with Pp-probability one for almost all 77's, and thus it is possible to find 
a non random k 2 such that P p (f] : k 2 > max{kj{r}), k&{r}), kgirj)) > 1 — \P{W > 

|). Let h such that (^)* ^ < -log| and ^ +a)k > 2^(1 

for k > k%. 

If we define the non random constant 



k w = m,ax(ki,k 2 , k 3 ) 



(79) 



and 



M fe =«Ne{-l,l} 



> ;Ur* < £ 2?fa)<cg(l + a)V, 



(pr) A 



]T |Z,f(r,)|<c 9 (l-a') fc (pr 2 ) fc 



(80) 



4 = 1 



then, for k > kn 



P P {M k ) > -P(W > -) > 0. 



(81) 



From l(77l) we then get 
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P P ( E CT « > °l CTo = J ) + ? p p( E CT « = = x ) 



■m k (ri) 

E ^ 

1=1 



•user 



(82) 



which we now estimate using the Gaussian approximation. Given 77, the ^"s 
are independent random variables, so we can substitute them with the equally 
distributed Z+'s, The Berry-Essen Theorem gives 



' m k (r)) 

p \ E 4* 



(pr) k (pr) k 



'm k (ri) 



(pr) (pr) k 
2 2 



1 



(83) 



with < -4^, where V k = ^or(^) - ££\ (,) *? and Sfc = £(|*| 3 ) 



™fc ^3 



i=l 



If 77 € M k , V k < c 8 (l + a)*T fe and 

„nfcA™.2\fc 



(84) 



so that 



P 



m k (ri) 

E z < 

i=l 



< 



(pr) k 



Vc S (l + c) fc r*= 1 _ 



\/c8 (l + o)'r* 

^(l + a) fc / ; 



2tt 



2 /2 da; _^_ (1 _ aWr fc / 2 



> 



3/2 



2^(1 + 0)^ 
for k>k 10 >k 3 - Together with ([ST]). |82|) this implies 



(85) 



p( e ^ > °i^o = i)+^p( E ff « = = 1 )^ \+ 



1 1 p k r k ' 2 



2 2 iu (l + a)fc/2 



(86) 
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with cio = H=P p (Mfe) > 0, for all k > 



10 • 



Proof of Theorem 4.1 From Lemma 4.7, the probability of error free trans- 
mission p(k) = 1 — 2e(k) satisfies 

for the binary transmission problem on T<- r )for k large enough. Therefore, 
there is reconstruction if 



l <p {k)r k /^ Cll (-^=) , (88) 



Pc(k) < ±±?- (89) 



which is to say 



for fc large enough. Let be the smallest a s.t. |89j) holds. Then lirnfc_ >00 <*k 
as required to prove the upper bound of Theorem 4.1. 

Similarly to ([77| we estimate, for j3 > 0, 
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l-e(k) = 




E ^ > 









a = l,\R k \>(l+P) k (pr) k 



( 



E = ° 

•P p (|i? fc |>(l + /3) fe (pr) fc |a = l) 



a = l,|P fe | > (l + /3) fe (pr) fc 



Pj E ^ > 



CT = 1, l^fcj < (1 + P) k (pr) k 



m k (ri) 

E ^ 



i=l 



<(l + /?) fe (pr)' 



E ^ = ° 



E * 



/ 



ff = l,|i2fc| < (l + /3)V) fe , 

<(l+/?)V)>o = l) 



<(l+/3) fe (pr)' 



E 3 



p P E ^ > ° 
\pp\ E cr " = 



(70 = 1,1^1 < (l + /3) fe (pr) fc , 
^o = l,|Pfe| < (l + /3) fe (pr) fe , 



m k {ri) 



>(1+/3)V)' 



E 3 



>(l + /3) fc (pr)' 



•P P (|P fe | <(l + /3)V) fe 



E 



>(l+/5) fc (pr) fc |a = l) 



(90) 

From Lemma 4.3 

P p (|P fe | > (l + (3) k (pr) k \a = 1) < c 5 e C6 ( 1+ « T * fe . 

In the third term, the expression between square brackets is exactly | since 
Zi's are independent and symmetric. 

Next we consider the second term. Assume first pr > 1. Let 7 be the 

(r) 

set of vertices in which are isolated FK clusters. Then, by large devia- 
tions for i.i.d. random variables, P P (|/| < \ (^r 2 ) f k < e~ crk . Moreover, from 



Lemma 4.8, the expression between square brackets in the second term of l|90p 
is bounded above by P p ( J2i=i — ((1 + P)P r ) k ^j i with Z[ i.i.d. symmetric 
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random variables with values in {—1, 1}. In turn, if |/| > (1 — p— e)r k > ^- the 
normal approximation implies that for some Cyx > 0, C13 > 0, 



Pn 



i=l 



< ((I + f3)pr) k j < c 12 ((l + (3) P V^) k + C13 
< ciA fc/2 (l + /3) fc 



(91) 

for a suitable ci large enough, where the last term comes from the Berry-Essen 
error estimate for the random variables Z' iy with |/| > ^- and -^j? < p k r k / 2 . 
Collecting the above estimates we have 



e[k) < e" 



c 12 ((l + 0)p^f + c 13 p k r k l\\ - a') k + 



Therefore, 



< I + i Cl ^ r fc / 2 (l + /3) & . 



(92) 



p(k) < Cl p k r k ' 2 {l + (3) k (93) 

and the condition for non-reconstruction on the rescaled tree T^ r ) becomes 

Cl /r fe (l + /3) fe < 1. (94) 

This implies 



Pc(k) > 



> 



1 



(95) 



(l + [3)c{ /k r ~ c{ /k r 
If, on the other hand, pr < 1, then for small enough f3, (l + (3)pr < 1 and the 



( r) a v = 



00 = 1, 



second term in square brackets of f90|) reduces to \P p ^ X)„ e T' 
but clearly in this case the symmetry is not broken and no reconstruction can 
take place. ■ 

From Theorem 4.1 it is obvious that the critical points p c {k) converge to the 
Ising model critical point. 
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5 Minority removal 



The self-correction mechanism discussed above is not suitable for biological 
transmission, in which offsprings, once generated, cannot be changed. How- 
ever, there is a similar mechanism, which consists of self-correcting a generation 
by removing the elements not belonging to the majority, which could be imple- 
mented in a biological setting. If r > 4 and such minority removal is carried out 
every step in blocks of size M, then in the renormalized tree each (macroscopic) 
vertex has a random number of children larger then or equal to 2, while the 
error rate is estimated as in |(4|) but on a random number of vertices, between 
4r and M; by taking inequalities as done below, one can see that <[5j) still holds 
with minor changes and thus reconstruction is also possible at every e < \ with 
a sufficiently large M. It is also the case that if a within-descent minority re- 
moval is carried out every k generations, only minor changes in the constants 
are needed in Theorem 4.1 and the limit of the critical points is still the Ising 
critical point as in Corollary 4.2. 

This highlights a possibly real but rather particular phenomenon. It looks 
like a bit of information in the parent biological unit is better transmitted, i.e. 
it is more resistant to random transmission errors, if enhanced by regularly 
destroying descendants not belonging to the local majority. From the biological 
point of view this is also likely to improve the functionality of local segments 
(cells or individuals, for instance). However, the minority removal sometimes 
preserves the wrong information, thus creating blocks of mutated descendants, 
a phenomenon similar to tumor formation. In this respect, our findings seem to 
suggest that tumor generation might be intrinsically connected to improvement 
in character transmission. Of course, any such claim must be warranted by the 
study of many bits transmission. 

Back to our single bit model, the minority-removal carried out every step 
by blocks of size M corresponds to first generating a random tree T' M by means 
of a transformation <£>' M analogous to <£>m and then identifying each block (of 
random size between 4p and M) by means of a transformation , analogous 
to ^m- Let P' e M = ^' M {& M {P e )) be the distribution on the resulting random 
tree T' M . 

Similarly, the within-descent minority removal carried out every fc-steps cor- 
responds to generating a random tree T' k by means of a transformation &' k , anal- 
ogous to <£>fc, and then identifying each block (of random size between ^- and 

r ) by means of a transformation ^' k , analogous to * fc . LetPi (fe) =%(& k (P e )) 
be the distribution on the resulting random tree T k . 

Note that T' M and T' k are Galton- Watson trees, since they are random trees 
with an i.i.d. number of offsprings in each vertex. In generating T' M at least M/2 
vertices are preserved in each block of size M; these have at least rM/2 > 2M 
descendants which can be divided into at least 2 blocks of size M (and possibly 
one remaining smaller block). Thus the number of descendants is at least 2. In 
generating T' r on the other hand, at least r k /2 vertices are preserved in each 
block of size r k and each such vertex gives rise to one descendant block, so 
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each block (which is a renormalized vertex) has at least r k /2 (and at most r k ) 
descendants. 

The branching numbers, which on the Galton- Watson trees equal the mean 
offspring number (see [ID]), satisfy then br{T' M ) > 2 and r k /2 < br(T{,) < r k . 

We begin with a Lemma stating that if on a subtree T' C T maximum 
likelihood reconstruction takes place, then it does also on T. 

Lemma 5.1 Given trees T" C T, if maximum likelihood reconstruction takes 
place on T' then it does also on T , i.e. ?/liminf„ A„(Pr') > then liminf ra A„(Pr) > 
0. 

Proof. Let A n = {a n e T n : P{a n \a = +1) > P(a n \a = -1)}, let A' n be 
the same with T n replaced by T' n and let B' = {a' n G T' n : P(a' n \ao = +1) = 
P{a' n \a = -1)}. We know P{A' n \a = +1) - P(A' n \a = -1) > 5 > for some 
S for large n, and we want to show the same for A n . However, denoting by 
p± (-) = p (-\ ± 1) we have P ± (A n n {A' n ) c ) = P T (A c n n A^) by symmetry, and 
for any event C, by definition of A n , 



p+(A„nc7) > P"(4nC) 

P + (A c n nc) < P~(A c n nC). (96) 



Then, 



P+(A n )-p-(A n ) 
= P+(A n n O + P+(A n n «) c ) + P+(A n n s') 
-P-(A, n A' n ) - p-(A n n K) c ) - p-(A n n s') 
= P + (A n n O + P"(^ n a;) + p+(A„ n p') 
-p-(A„ n A' n ) - p + (A c n n O - p-(A„ n B') 
>P + {A n nA' n ) + P + {A c n nA' n ) 
-p-(A n nA' n )-p-(A c n nA' n ) 
= P+{A' n )-p-{A' n ) 



(97) 

from which the result follows. ■ 

The results for minority removal can be summarized as follows. Notice that 
in the proof we use maximum likelihood reconstruction to use Lemma 5.1 and 
get a bound on the critical point; on the other hand, it is shown in jl] that for 
binary tree the critical points for majority or maximum likelihood reconstruction 
coincide. 

Theorem 5.2 
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i) If f > 4, in the minority removal carried out every step with blocks of size 

M, for every e £ [0, 1/2) 3M : VAf > M 

liminf A n (i* M ) >0. 

n ' 

ii) In the within- descent minority removal carried out every k steps if p' c (k) is 

the critical point then with c > as in Theorem 4-1 we have 

1 , 4 A 

-x- < pM < — 

z 2t r C2fc r 

so that 

lira p'ik) = -. 
fc^co r 



Proof, i) In generating T' M at least vertices were preserved in each block of 
size M; these vertices have r^- > 4-^- = 2M descendants which can be divided 
into at least two blocks of size M (and some remaining others, possibly smaller). 
Thus, the number of descendants in the renormalized tree is at least 2. 

On the other hand, the error rate e' M satisfies ((5j) with M replaced by 4f . 
By Lemma 5.1, maximum likelihood reconstruction on T' M follows from that on 

T (M/2) whi( , h ig ensured by 

2(1 - 2e' M f > 2(1 - 2e- c ' M/2 f > 1 (98) 

which is satisfied for large M. 

ii) In generating T' r at least ^- vertices are preserved in each block of size r k ; 
each such vertex gives rise to one descendant block, so the branching number 

k 

of the renormalized tree is at least ^- . 

Also, it is possible to show bounds on the renormalized error free trans- 
mission p'(k) similar to those used to prove Theorem 4.1. By carefully going 
through that proof, one can see that if pr > 1 

p'{kf <2c\p 2k r k {l + l3) k (99) 

as in <f93|l if pr < 1 again p(k) is exponentially small in k and thus there is no 
reconstruction; and, finally 

m>c*j^ii- (100) 

as in JlZj. 

Again by Lemma 5.1 this implies 

1 1 ,„N 4 ^ 

- V — <p' c (k) < -j- 

r (2 Cl ) 2k r c fr 
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and 



lim p'Jk) = -. 
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